Robust Speaker Localization through Ad (AWEPAT) Estim
نویسنده
چکیده
Time delay of arrival (TDOA) estimation between signals input to two or more microphones plays an important role in speaker localization. Most methods employ a linear array of two or more microphones and use the generalized cross correlation method or eigenspace analysis (AEDA) methods. TDOA estimation with linear arrays, however, is highly sensitive to estimation errors when the signals arrive from an endfire direction. In this paper we propose a novel adaptive algorithm which makes use of a three-microphone planar array. This algorithm exhibits a much smaller estimation error over the complete azimuth range of 0–360 degrees as compared to other algorithms. The computational complexity of this approach is comparable to other state-of-the-art algorithms.
منابع مشابه
Oriented global coherence field for the estim smart rooms equipped with distribu
This paper proposes a new method for estimating the talker’s head orientation in a smart room equipped with microphone arrays. The acoustic processing is based on the use of a coherence measure derived from the Cross-power spectrum phase analysis, commonly used for speaker localization and tracking purposes. An Oriented Global Coherence Field function is then introduced to assign to a given poi...
متن کاملRobust speech recognition with speaker localization by a microphone array
This paper proposes robust speech recognition with Speaker Localization by a Arrayed Microphone (SLAM) to realize hands-free speech interface in noisy environments. In order to localize a speaker direction accurately in low SNR conditions, a speaker localization algorithm based on extracting a pitch harmonics is introduced. To evaluate the performance of the proposed system, speech recognition ...
متن کاملClassification of time delay estimates for robust speaker localization
This paper proposes a solution to the problem of robust speaker localization under adverse acoustic conditions. The approach is based on the classification of time delay estimates. Two classification techniques are investigated in detail: maximum likelihood (ML) classification and classification based on histogram comparison. Their performance under adverse acoustic conditions is compared to ou...
متن کاملLearning the Time-delay Manifold for Robust Speaker Localization
We present an algorithm for high dimensional density estimation which is efficient (both computationally and statistically) when the distribution is concentrated close to a low dimensional smooth manifold. The algorithm uses several random projections to generate a hierarchical mixture of Gaussians which rapidly converges to the underlying manifold. We use this algorithm to perform robust estim...
متن کاملLocalization dominance in the median-sagittal plane: effect of stimulus duration.
Localization dominance is an aspect of the precedence effect (PE) in which the leading source dominates the perceived location of a simulated echo (lagging source). It is known to be robust in the horizontal/azimuthal dimension, where binaural cues dominate localization. However, little is known about localization dominance in conditions that minimize binaural cues, and most models of precedenc...
متن کامل